Karaoke processing method and system

ABSTRACT

A karaoke processing method and a karaoke processing system are provided. The method includes: receiving a first antiphonal singing request inputted by a first user; acquiring an accompaniment audio file and a lyrics file of a song from a server based on the first antiphonal singing request, and recording singing performed by the first user based on a part of lyrics by using the accompaniment audio file and the lyrics file, to obtain a first recorded file; determining a first lyrics part based on the first recorded file and generating paragraph information; performing audio mixing processing on the first recorded file and the accompaniment audio file, to obtain a first target file; and sending the first target file and the paragraph information to the server.

This application is the national phase of International PatentApplication No. PCT/CN2016/081651, filed on May 11, 2016, which claimsthe priority to Chinese Patent Application No. 201510280492.9, titled“KARAOKE PROCESSING METHOD AND SYSTEM”, filed with the Chinese StateIntellectual Property Office on May 27, 2015, both of which areincorporated herein by reference in their entireties.

FIELD

The present disclosure relates to the field of audio processing, andparticularly to a karaoke processing method and a karaoke processingsystem.

BACKGROUND

With quality enhancement of people's life, karaoke applications aregradually entering into people's daily life, with increasingly richfunctions.

The karaoke application is a combination of a music player and recodingsoftware, which may not only play the original vocal, but also recordsinging of a user, and mix the recoded singing and an accompaniment toobtain a music file. The user may also upload the above music file ontothe Internet, so that more people can hear his/her song.

Typically, when the user records his/her song, the karaoke applicationprovides an accompaniment audio file to serve as accompaniment forrecording the song of the user. However, in this way, the user can onlysing alone, while cannot have the karaoke experience of singing withanother person.

SUMMARY

A karaoke processing method and a karaoke processing system are providedaccording to the embodiments of the present disclosure, to provide thekaraoke experience of singing together with another person.

A karaoke processing method is provided in a first aspect of theembodiments of the present disclosure, which includes:

receiving, by a first client, a first antiphonal singing requestinputted by a first user, where the first antiphonal singing request isused to request initiating antiphonal singing;

acquiring, by the first client, an accompaniment audio file and a lyricsfile of a song from a server based on the first antiphonal singingrequest, and recording, by the first client, singing performed by thefirst user based on a part of lyrics by using the accompaniment audiofile and the lyrics file, to obtain a first recorded file;

determining a first lyrics part and generating paragraph information bythe first client, where the first lyrics part is the part of lyricsalready sung by the first user, and the paragraph information includesidentification information for identifying the first lyrics part;

performing, by the first client, audio mixing processing on the firstrecorded file and the accompaniment audio file, to obtain a first targetfile; and

sending, by the first client, the first target file and the paragraphinformation to the server.

A karaoke processing system is provided in a second aspect of theembodiments of the present disclosure, which includes a first client anda second client, where

the first client is configured to: receive a first antiphonal singingrequest inputted by a first user, where the first antiphonal singingrequest is used to request to initiating antiphonal singing; acquire anaccompaniment audio file and a lyrics file of a song from a server basedon the first antiphonal singing request, and record singing performed bythe first user based on a part of lyrics by using the accompanimentaudio file and the lyrics file, to obtain a first recorded file;determine a first lyrics part based on the first recorded file andgenerate paragraph information, where the first lyrics part is the partof lyrics already sung by the first user, and the paragraph informationincludes identification information for identifying the first lyricspart; perform audio mixing processing on the first recorded file and theaccompaniment audio file, to obtain a first target file; and send thefirst target file and the paragraph information to the server; and

the second client is configured to acquire the first target file and theparagraph information from the server, and provide a karaoke service toa second user based on the paragraph information with taking the firsttarget file as accompaniment audio.

In the technical solution according to the embodiments of the presentdisclosure, upon receiving the first antiphonal singing request inputtedby the first user, the first client acquires the accompaniment audiofile and the lyrics file of the song from the server based on the firstantiphonal singing request, and records singing performed by the firstuser based on a part of lyrics by using the accompaniment audio file andthe lyrics file to obtain the first recorded file. Then the first clientdetermines a first lyrics part and generates paragraph information,where the first lyrics part is the part of lyrics already sung by thefirst user, and the paragraph information includes identificationinformation for identifying the first lyrics part. Then the first clientperforms audio mixing processing on the first recorded file and theaccompaniment audio file to obtain a first target file, and sends thefirst target file and the paragraph information to the server, so thatthe second client acquires the first target file and the paragraphinformation from the server and provides a karaoke service to the seconduser based on the paragraph information with taking the first targetfile as accompaniment audio. Therefore, the first target file in theembodiments of the present disclosure includes the singing of the firstuser only in a display time of the first lyrics part, and theaccompaniment for the other part of the lyrics than the first lyricspart. In this case, singing may be performed based on the other part ofthe lyrics than the first lyrics part according to the paragraphinformation when the second client takes the first target file asaccompaniment audio. Therefore, it can be realized that the first userperforms singing in the display time of the first lyrics part, and thesecond user performs singing in a display time of the other part of thelyrics than the first lyrics part, thereby providing the karaokeexperience of singing together with another person.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of a karaoke processing method accordingto an embodiment of the present disclosure;

FIG. 2 is a schematic diagram of a karaoke processing method accordingto another embodiment of the present disclosure;

FIG. 3 is a schematic diagram of a karaoke processing method accordingto another embodiment of the present disclosure;

FIG. 4 is a schematic diagram of a karaoke processing system accordingto an embodiment of the present disclosure; and

FIG. 5 is a schematic structural diagram of a client to which a karaokeprocessing method is applied to according to an embodiment of thepresent disclosure.

DETAILED DESCRIPTION OF THE EMBODIMENTS

A karaoke processing method and a karaoke processing system are providedaccording to the embodiments of the present disclosure, to realize thekaraoke experience of singing together with another person. The karaokeprocessing method and the karaoke processing system are described indetail below.

The technical solutions according to the embodiments of the presentdisclosure are described clearly and completely in conjunction withdrawings in the embodiments of the present disclosure. Apparently, thedescribed embodiments are only a part rather than all of the embodimentsof the present disclosure. All other embodiments obtained by thoseskilled in the art without creative work based on the embodiments of thepresent disclosure fall within the protection scope of the presentdisclosure.

Terms such as “first”, “second”, “third” and “fourth” (if present) inthe specification and claims and the above drawings of the presentdisclosure are used to distinguish similar objects, rather than describea specific order or a sequential order. It should be understood thatdata described with the terms may be exchanged in suitable situations,so that the embodiments described here can be embodied in an order otherthan content shown or described here. In addition, terms such as“include” and “comprise” and any variations thereof are intended tocover non-exclusive “include”, for example, a process, a method, asystem, a product or a device including a series of steps or units notonly includes the steps or units listed clearly, but may also includeother steps or units not listed clearly or inherent to the process, themethod, the product or the device.

It should be noted that, the client in the embodiments of the presentdisclosure may include a phone, a personal computer (PC) and/or a panelcomputer or other device. In actual implementation, a karaoke servicemay be accessed by logging onto a webpage through the client, or loadinga program on the client, or using applications (App) published throughvarious mobile platforms.

In the embodiments of the present disclosure, a server is connected tothe client via a network. The server may be constructed by a serviceprovider (for example, a karaoke service provider) for managing andoperating a service provided by the service provider, and the server mayinclude various servers.

Referring to FIG. 1, a karaoke processing method according to anembodiment of the present disclosure includes steps 101 to 105.

In step 101, a first client receives a first antiphonal singing requestinputted by a first user.

In the embodiment, the first antiphonal singing request is used torequest initiating antiphonal singing, to trigger an antiphonal singinginitiating flow. For example, in practice, the first client may beprovided with a button corresponding to the first antiphonal singingrequest. The first client receives a first antiphonal singing requestwhen the first user presses the button.

In step 102, the first client acquires an accompaniment audio file and alyrics file of a song from a server based on the first antiphonalsinging request, and records singing performed by the first user toobtain a first recoded file. The singing is performed by the first userbased on a part of lyrics by using the accompaniment audio file and thelyrics file.

In the embodiment, upon receiving the first antiphonal singing request,the first client acquires an accompaniment audio file and a lyrics fileof a song from a server. The accompaniment audio file and an originalaudio file are raw audio data. In practice, if the acquiredaccompaniment audio file and the original audio file are audio data in acompressed format, such as the mp3 format, the first client decompressesthe audio data in the compressed format into raw audio data. The lyricsfile includes time information for identifying a display time of lyrics,to facilitate synchronizing the lyrics file with the accompaniment audiofile based on the time information. For example, the lyrics file mayinclude an LRC lyrics file, which identifies a display time of thelyrics through a time tag included therein.

In the embodiment, after acquiring the accompaniment audio file and thelyrics file, the client plays the accompaniment audio file and thelyrics file, and the first user may select any part of the lyrics tosing. In practice, the first user does not sing all of the lyrics.Synchronously the client records the singing performed by the first userbased on the part of the lyrics by using the accompaniment audio fileand the lyrics file, to obtain the first recorded file.

In step 103, the first client determines a first lyrics part based onthe first recorded file, and generates paragraph information.

In the embodiment, the first lyrics part refers to the part of lyricsalready sung by the first user. That is, based on the first recordedfile, the first client recognizes the part of lyrics already sung by thefirst user and generates paragraph information. The paragraphinformation includes identification information for identifying thefirst lyrics part, to mark the part of lyrics already sung by the firstuser.

Optionally, in the embodiment, the identification information mayinclude a display time corresponding to the first lyrics part. Forexample, a starting time of the first lyrics part is at 10 s, and anending time of the first lyrics part is at 15 s, and the like.Alternatively, the identification information may include a sentenceidentifier for the first lyrics part. For example, the first lyrics partincludes the first sentence, the third sentence, the fifth sentence andthe like. A particular form of the identification information is notlimited here, as long as the first client can recognize the first lyricspart based on the identification.

In the embodiment, the first client may determine the first lyrics partin multiple manners. For example, the first client may determine thepart of lyrics already sung by the first user through effectiveness ofvoice in the first recorded file.

For example, the first client determines whether recorded voice of thefirst user corresponding to a lyrics part is effective, and determinesthe lyrics part as the first lyrics part in a case that the recordedvoice of the first user corresponding to the lyrics part is effective.

Optionally, in the embodiment, the first client determines whether thevolume of the recorded voice of the first user corresponding to thelyrics part is greater than a first preset value, and determines thatthe recorded voice is effective in a case that the volume of therecorded voice is greater than the first preset value. The aboveprocesses may be performed by the VAD algorithm or an improved algorithmthereof

Preferably, after determining that the recorded voice of the first usercorresponding to the lyrics part is effective, the first client furtherdetermines whether a time occupied by the recorded voice is consistentwith a display time corresponding to the lyrics part, and determines thelyrics part as the first lyrics part in a case that the time occupied bythe recorded voice is consistent with the display time corresponding tothe lyrics part.

For example, the determination is based on sentences in the lyrics.After it is determined that recorded voice corresponding to a sentencein the lyrics is effective, the sentence of the lyrics is determined asthe first lyrics part in a case that a time occupied by the recordedvoice corresponding to the sentence of the lyrics is from 20 s to 30 sand a display time corresponding to the sentence of the lyrics is alsofrom 20 s to 30 s. A threshold may be set here. In a case that acoincidence ratio between a time occupied by the recorded voicecorresponding to the sentence of the lyrics and a display timecorresponding to the sentence of the lyrics is greater than thethreshold, it is determined that the time occupied by the recorded voicecorresponding to the sentence of the lyrics is consistent with thedisplay time corresponding to the sentence of the lyrics.

It may be understood that the first lyrics part may be determined inother manners in the embodiment, which is not limited here. In practice,the first client may determine the first lyrics part by a real-timeaudio detecting module using the above determination manners.

In step 104, the first client performs audio mixing processing on thefirst recorded file and the accompaniment audio file, to obtain a firsttarget file.

In the embodiment, the audio mixing processing may be performed by anexisting audio mixing algorithm of, for example, adding data ofcorresponding sampling points, and then dividing the sum by two.

Optionally, in the embodiment, in order to prevent the first recordedfile from further including some noise or voice determined to beineffective for the other part of the lyrics than the first lyrics part,before step 104 of the karaoke processing method, the first client mayclear audio data corresponding to the other part of the lyrics than thefirst lyrics part in the first recorded file based on the paragraphinformation. In this way, it can be ensured that the obtained firstrecorded file includes audio data only in the display time of the firstlyrics part, which is equivalent to only including effective voice ofthe singing of the first user.

In step 105, the first client sends the first target file and theparagraph information to the server, so that a second client acquiresthe first target file and the paragraph information from the server andprovides a karaoke service to a second user based on the paragraphinformation with taking the first target file as accompaniment audio.

In the embodiment, the second client may provide a karaoke service to asecond user based on the paragraph information with taking the firsttarget file as accompaniment audio after acquiring the first target fileand the paragraph information from the server. The second user maydetermine the lyrics part that is already sung based on the paragraphinformation, and perform singing based on the lyrics that is not sungyet.

Optionally, in the embodiment, the first client may generate aconfiguration file. The configuration file includes a sound effectparameter used by the first user. The first client sends theconfiguration file to the server, so that the second client provides akaraoke service to the second user according to the paragraphinformation and the configuration file with taking the first target fileas accompaniment audio. Optionally, the above sound effect parameterincludes a reverberation parameter, a sound-modifying and tone-modifyingparameter, a voice-beautifying parameter and the like.

In the technical solution according to the embodiment of the presentdisclosure, upon receiving a first antiphonal singing request inputtedby the first user, the first client acquires an accompaniment audio fileand a lyrics file of a song from the server based on the firstantiphonal singing request, and records singing performed by the firstuser based on a part of lyrics by using the accompaniment audio file andthe lyrics file to obtain a first recorded file. Then the first clientdetermines a first lyrics part and generates paragraph information,where the first lyrics part refers to the part of lyrics already sung bythe first user, and the paragraph information includes identificationinformation for identifying the first lyrics part. Then the first clientperforms audio mixing processing on the first recorded file and theaccompaniment audio file to obtain a first target file, and sends thefirst target file and the paragraph information to the server, so thatthe second client acquires the first target file and the paragraphinformation from the server and provides a karaoke service to the seconduser based on the paragraph information with taking the first targetfile as accompaniment audio. The first target file in the embodiments ofthe present disclosure includes the singing of the first user only inthe display time of the first lyrics part, and includes an accompanimentfor the other part of the lyrics than the first lyrics part. In thiscase, singing may be performed based on the other part of the lyricsthan the first lyrics part according to the paragraph information whenthe second client takes the first target file as accompaniment audio.Therefore, it can be realized that the first user performs singing inthe display time of the first lyrics part, and the second user performssinging in the display time of the other part of the lyrics than thefirst lyrics part, thereby providing the karaoke experience of singingtogether with another person.

On the basis of the embodiment shown in FIG. 1, it is further describedbelow how the second client provides a karaoke service to the seconduser based on the paragraph information with taking the first targetfile as accompaniment audio to provide the karaoke experience of singingtogether by two users. Referring to FIG. 2, a karaoke processing methodaccording to another embodiment of the present disclosure includes steps201 to 207.

In step 201, a first client receives a first antiphonal singing requestinputted by a first user.

In step 202, the first client acquires an accompaniment audio file and alyrics file of a song from a server based on the first antiphonalsinging request, and records singing performed by the first user toobtain a first recorded file. The singing is performed by the first userbased on a part of lyrics by using the accompaniment audio file and thelyrics file.

In step 203, the first client determines a first lyrics part based onthe first recorded file, and generates paragraph information.

In step 204, the first client performs audio mixing processing on thefirst recorded file and the accompaniment audio file, to obtain a firsttarget file.

In step 205, the first client sends the first target file and theparagraph information to the server, so that a second client acquiresthe first target file and the paragraph information from the server andprovides a karaoke service to a second user based on the paragraphinformation with taking the first target file as accompaniment audio.

In the embodiment, the processes of step 201 to step 205 are the same asthose of step 101 to step 105 in the embodiment shown in FIG. 1, whichare not described repeatedly here.

In step 206, the second client receives a second antiphonal singingrequest sent by the second user.

In the embodiment, the second antiphonal singing request is used torequest joining the antiphonal singing, to trigger an antiphonal joiningflow. For example, in practice, the second client may be provided with abutton corresponding to the second antiphonal singing request, and thesecond client receives a second antiphonal singing request when thesecond user presses the button.

In step 207, the second client acquires the first target file, thelyrics file and the paragraph information of the song from the serverbased on the second antiphonal singing request, and record singingperformed by the second user to obtain a second recorded file. Thesinging is performed by the second user based on all of the other partof the lyrics than the first lyrics part according to the paragraphinformation with taking the first target file as accompaniment audio.

In the embodiment, the second client acquires the first target file, thelyrics file and the paragraph information from the server upon receivingthe second antiphonal singing request, and plays the first target fileand the lyrics file. The second user performs singing based on all ofthe other part of the lyrics than the first lyrics part, andsynchronously the second client records the singing performed by thesecond user based on all of the other part of the lyrics than the firstlyrics part according to the paragraph information with taking the firsttarget file as accompaniment audio, to obtain a second recorded file.

Optionally, in the embodiment, in a case that a configuration file issent to the server by the first client and the server further includesthe configuration file, in step 207 the second client may acquire thefirst target file, the lyrics file, the paragraph information and theconfiguration file of the song from the server, and record singingperformed by the second user based on all of the other part of thelyrics than the first lyrics part according to the paragraph informationand the configuration file with taking the first target file asaccompaniment audio, to obtain the second recorded file. In practice,the client performs configuration based on the sound effect parameter inthe configuration file, and presents the configuration to the seconduser to be selected by the second user. The second user may also modifythe parameter configuration based on actual needs.

In step 208, the second client performs audio mixing processing on thesecond recorded file and the first target file, to obtain a secondtarget file.

In the embodiment, a process of the audio mixing processing may beperformed by an existing audio mixing algorithm of, for example, addingdata of corresponding sampling points and dividing the sum by two.

It may be understood that, the second target file in the embodimentincludes the singing of the first user corresponding to the first lyricspart, and the singing of the second user corresponding to all of theother part of the lyrics than the first lyrics part, thereby providing akaraoke experience of singing together by two users.

Optionally, in the embodiment, in order to prevent the second recordedfile from further including some noise or voice determined to beineffective in the first lyrics part, before step 208 of the karaokeprocessing method, the second client may clear audio data correspondingto the first lyrics part in the first recorded file based on theparagraph information.

On the basis of the embodiment shown in FIG. 1, it is further describedhow the second client provides a karaoke service to the second userbased on the paragraph information with taking the first target file asaccompaniment audio, to provide the karaoke experience of singingtogether by multiple users (more than two users). Referring to FIG. 3, akaraoke processing method according to another embodiment of the presentdisclosure includes steps 301 to 310.

In step 301, a first client receives a first antiphonal singing requestinputted by a first user.

In step 302, the first client acquires an accompaniment audio file and alyrics file of a song from a server based on the first antiphonalsinging request, and records singing performed by a first user to obtaina first recorded file. The singing is performed by a first user based ona part of lyrics by using the accompaniment audio file and the lyricsfile.

In step 303, the first client determines a first lyrics part based onthe first recorded file, and generates paragraph information.

In step 304, the first client performs audio mixing processing on thefirst recorded file and the accompaniment audio file, to obtain a firsttarget file.

In step 305, the first client sends the first target file and theparagraph information to the server, so that a second client acquiresthe first target file and the paragraph information from the server andprovides a karaoke service to a second user based on the paragraphinformation with taking the first target file as accompaniment audio.

In the embodiment, the processes of step 301 to step 305 are the same asthe processes of step 101 to step 105 in the embodiment shown in FIG. 1,which are not described repeatedly here.

In step 306, the second client receives a third antiphonal singingrequest sent by the second user.

In the embodiment, the third antiphonal singing request is used torequest joining the antiphonal singing and indicate to record based ononly a part of the other part of the lyrics than the first lyrics part,to trigger a joining flow in an antiphon initiating mode. For example,in practice, the second client may be provided with a buttoncorresponding to the third antiphonal singing request, and the secondclient receives a third antiphonal singing request when the second userpresses the button.

In step 307, the second client acquires the first target file, thelyrics file and the paragraph information of the song from the serverbased on the third antiphonal singing request, and records singingperformed by the second user to obtain a third recorded file. Thesinging is performed by the second user based on a part of the otherpart of the lyrics than the first lyrics part according to the paragraphinformation with taking the first target file as accompaniment audio.

In the embodiment, upon receiving the third antiphonal singing request,the second clients acquires the first target file, the lyrics file andthe paragraph information from the server, and plays the first targetfile and the lyrics file. The second user performs singing based on apart of the other part of the lyrics than the first lyrics part, andsynchronously the second client records singing performed by the seconduser based on the part of the other part of the lyrics than the firstlyrics part according to the paragraph information with taking the firsttarget file as accompaniment audio, to obtain the third recorded file.It should be noted that, the embodiment is different from the embodimentshown in FIG. 2 in that the second user performs singing based on a partof the other part of the lyrics than the first lyrics part, instead ofall of the other part of the lyrics than the first lyrics part.Therefore, the song includes a part to be used by another user joiningthe antiphonal singing.

In step 308, the second client determines a second lyrics part, and addsidentification information for identifying the second lyrics part in theparagraph information.

The second lyrics part refers to the part of the lyrics sung by thesecond user. For the method for determining the second lyrics part and aform of the identification information, one may refer to the descriptionin step 103 in the embodiment shown in FIG. 1, which is not describedrepeatedly here.

In step 309, the second client performs audio mixing processing on thethird recorded file and the first target file, to obtain a third targetfile.

In the embodiment, a process of the audio mixing processing may beperformed by an existing audio mixing algorithm of, for example, addingdata of corresponding sampling points and dividing the sum by two.

Optionally, in the embodiment, in order to prevent the third recordedfile from further including some noise or voice determined to beineffective in the other part of the lyrics than the second lyrics part,before step 309 of the karaoke processing method, the second client mayfurther clear audio data corresponding to the other part of the lyricsthan the second lyrics part in the third recorded file based on theparagraph information. In this way, it can be ensured that the obtainedthird recorded file includes audio data only in the display time of thesecond lyrics part, which is equivalent to only including effectivevoice of the singing of the second user.

In step 310, the second client sends the third target file and theparagraph information to the server, so that a third client, through theserver, records singing performed by a third user based on a lyrics partother than the first lyrics part and the second lyrics part according tothe paragraph information with taking the third target file asaccompaniment audio.

In the embodiment, the third client acquires the third target file andthe paragraph information from the server, and may provide a karaokeservice to the third user based on the paragraph information with takingthe third target file as accompaniment audio. The third user maydetermine which part of the lyrics is already sung (including the firstlyrics part already sung by the first user and the third lyrics partalready sung by the second user), and perform singing based on a lyricspart that is not been sung yet.

Optionally, in the embodiment, the second client may further add a soundeffect parameter used by the second user in the configuration file, sothat the third client provides a karaoke service to the second userbased on the paragraph information and the configuration file withtaking the third target file as accompaniment audio. Optionally, thesound effect parameter includes a reverberation parameter, asound-modifying and tone-modifying parameter, a voice-beautifyingparameter and the like.

It may be understood that, the third target file in the embodimentincludes the singing performed by the first user based on the firstlyrics part and the singing performed by the second user based on thesecond lyrics part, and one or more other users may perform singingbased on a lyrics part other than the first lyrics part and the secondlyrics part, thereby providing karaoke experience of singing together bymultiple users.

The karaoke processing method according to the embodiment of the presentdisclosure is described above, and a karaoke processing system accordingto an embodiment of the present disclosure is described below. Referringto FIG. 4, the karaoke processing system according to the embodiment ofthe present disclosure includes a first client and a second client.

The first client 401 is configured to: receive a first antiphonalsinging request inputted by a first user, where the first antiphonalsinging request is used to request initiating antiphonal singing;acquire an accompaniment audio file and a lyrics file of a song from aserver based on the first antiphonal singing request, and record singingperformed by the first user to obtain a first recoded file, where thesinging is performed by the first user based on a part of lyrics byusing the accompaniment audio file and the lyrics file; determine afirst lyrics part based on the first recorded file, and generateparagraph information, where the first lyrics part refers to the part oflyrics already sung by the first user, and the paragraph informationincludes identification information for identifying the first lyricspart; perform audio mixing processing on the first recorded file and theaccompaniment audio file to obtain a first target file; and send thefirst target file and the paragraph information to the server.

The second client 402 is configured to acquire the first target file andthe paragraph information from the server, and provide a karaoke serviceto a second user based on the paragraph information with taking thefirst target file as accompaniment audio.

Optionally, in the embodiment, the first client 401 may be configured todetermine whether recorded voice of the first user corresponding to alyrics part is effective, and determine the lyrics part as the firstlyrics part in a case that the recorded voice of the first usercorresponding to the lyrics part is effective.

Optionally, in the embodiment, the first client 401 may be configured todetermine whether the volume of the recorded voice of the first usercorresponding to the lyrics part is greater than a preset value, anddetermine that the recorded voice is effective in a case that the volumeof the recorded voice is greater than the preset value.

Optionally, in the embodiment, the first client 401 may be configured todetermine whether recorded voice of the first user corresponding to alyrics part is effective, and further determine whether a time occupiedby the recorded voice is consistent with a display time corresponding tothe lyrics part in a case that the recorded voice of the first usercorresponding to the lyrics part is effective, and determine the lyricspart as the first lyrics part in a case that the time occupied by therecorded voice is consistent with the display time corresponding to thelyrics part.

Optionally, in the embodiment, the first client 401 may be furtherconfigured to clear audio data corresponding to the other part of thelyrics than the first lyrics part in the first recorded file based onthe paragraph information before performing audio mixing processing onthe first recorded file and the accompaniment audio file to obtain thefirst target file.

Optionally, in the embodiment, the identification information includes adisplay time corresponding to the first lyrics part or a sentenceidentifier of the first lyrics part.

Optionally, in the embodiment, the first client 401 may be furtherconfigured to generate a configuration file before sending the firsttarget file and the paragraph information to the server, and send theconfiguration file to the server. The configuration file includes asound effect parameter used by the first user.

Optionally, in the embodiment, the second client 402 may be configuredto: receive a second antiphonal singing request sent by a second user,where the second antiphonal singing request is used to request joiningthe antiphonal singing; acquire the first target file, the lyrics fileand the paragraph information of the song from the server based on thesecond antiphonal singing request, and record singing performed by thesecond user to obtain a second recorded file, where the singing isperformed by the second user based on all of the other part of thelyrics than the first lyrics part according to the paragraph informationwith taking the first target file as accompaniment audio; and performaudio mixing processing on the second recorded file and the first targetfile to obtain a second target file.

Optionally, in the embodiment, the second client 402 may be furtherconfigured to clear audio data corresponding to the first lyrics part inthe second recorded file based on the paragraph information beforeperforming audio mixing processing on the second recorded file and thefirst target file to obtain the second target file.

Optionally, in the embodiment, the second client 402 may be configuredto, in a case that the server further includes the configuration file,acquire the first target file, the lyrics file, the paragraphinformation and the configuration file of the song from the server, andrecord singing performed by the second user based on all of the otherpart of the lyrics than the first lyrics part according to the paragraphinformation and the configuration file with taking the first target fileas accompaniment audio, to obtain the second recorded file.

Optionally, in the embodiment, the second client may be configured to:receive a third antiphonal singing request sent by the second user,where the third antiphonal singing request is used to request joiningthe antiphonal singing and indicate to record based on only a part ofthe other part of the lyrics than the first lyrics part; acquire thefirst target file, the lyrics file and the paragraph information of thesong from the server based on the third antiphonal singing request, andrecord singing performed by the second user to obtain a third recordedfile, where the singing is performed by the second user based on thepart of the other part of the lyrics than the first lyrics partaccording to the paragraph information with taking the first target fileas accompaniment audio; determine a second lyrics part, and addidentification information for identifying the second lyrics part in theparagraph information, where the second lyrics part refers to the lyricspart already sung by the second user; perform audio mixing processing onthe third recorded file and the first target file, to obtain a thirdtarget file; and send the third target file and the paragraphinformation to the server, so that a third client, through the server,records singing performed by a third user based on a lyrics part otherthan the first lyrics part and the second lyrics part according to theparagraph information with taking the third target file as accompanimentaudio.

It may be clearly known by those skilled in the art that for convenienceand conciseness of description, operating processes of the system, thedevice and the unit described above are not described repeatedly here,and one may refer to corresponding processes in the method embodimentsdescribed above for details.

Reference is made to FIG. 5, which is a schematic structural diagram ofa client to which the karaoke processing method is applied according toan embodiment of the present disclosure. The client in the embodiment isimplemented as a computer or a similar operating apparatus. In practice,the client may also be a phone, a panel device or other device.

As shown in FIG. 5, a computing apparatus 500 includes one or moreprocessors 502 (only one processor is shown in FIG. 5), a memory 504 anda transmission module 506. It may be understood by those skilled in theart that the structure shown in FIG. 5 is only for illustration, and isnot intended to limit the structure of the above electronic apparatus.For example, the computing apparatus 500 may further include componentsmore or less than components shown in FIG. 5, or have a differentconfiguration from FIG. 5.

The memory 504 may be configured to store a software program and module,for example, program instructions/modules corresponding to the karaokeprocessing method in the embodiment of the present disclosure. Theprocessor 502 executes various function applications and data processingby running the software program and module stored in the memory 504, toimplement the above karaoke processing method. The memory 504 mayinclude a high-speed random memory, and may also include a non-volatilememory, such as one or more magnetic storage, a flash memory or othernon-volatile solid-state memory. In some embodiments, the memory 504 mayfurther include a memory arranged remotely with respect to the processor502, and the remote memory may be connected to the apparatus 500 via anetwork. The instance of the above network may include but is notlimited to the Internet, an intranet, a local network, a mobilecommunication network and a combination thereof.

The transmission module 506 is configured to receive or transmit datavia a network. The above network may include for example a wired networkand a wireless network. In an example, the transmission module 506includes a network interface controller (NIC), which may be connected toanother network device or a router via a network cable, to communicatewith the Internet. In an example, the transmission module 506 is a radiofrequency (RF) module, which is used to communicate with the Internet ina wireless manner.

It should be understood that, according to the embodiments of thepresent disclosure, the disclosed system, apparatus and methods may beimplemented in other ways. For example, the described apparatusembodiment is merely for illustration. The units are divided merelybased on logical functions, and the units may be divided with otherdivision manner in practice. For example, multiple units or modules maybe combined, or may be integrated into another system, or some featuresmay be omitted or not be implemented. In addition, the displayed ordiscussed couplings, direct couplings or communication connections maybe implemented as indirect couplings or communication connections viasome interfaces, devices or units, which may be electrical, mechanicalor in other forms.

The units described as separate components may be or not be separatedphysically. The components shown as units may be or not be physicalunits, i.e., the units may be located at one place or may be distributedonto multiple network units. All of or part of the units may be selectedbased on actual needs to implement the solutions according to theembodiments.

In addition, function units according to the embodiments of the presentdisclosure may be integrated in one processing unit, or the units mayexist separately, or two or more units may be integrated in one unit.The integrated unit may be implemented in a form of hardware or asoftware function unit.

If the integrated units are implemented in the form of software functionunit and the software function unit is sold or used as separateproducts, the software function unit may also be stored in a computerreadable storage medium. Based on such understanding, an essential partof the technical solutions of the present disclosure, i.e., the part ofthe technical solutions of the present disclosure that contribute to theexisting technology, or all or a part of the technical solutions may beembodied in the form of a computer software product. The computersoftware product is stored in a storage medium, and includes severalinstructions for instructing a computer device (which may be a personalcomputer, a server, a network device or the like) to implement all or apart of the steps of the methods according to the embodiments of thepresent disclosure. The foregoing storage medium includes various mediathat can store program codes, for example, a USB disk, a mobile harddisk drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), amagnetic disk, an optical disk and the like.

To sum up, it should be noted that, the above embodiments are only usedto describe the technical solutions of the present disclosure, and notintended to limit the present disclosure. Although the presentdisclosure is illustrated in detail with reference to the embodimentsdescribed above, those skilled in the art should appreciate thatmodifications may be made to the technical solutions of the embodimentsdescribed above, or equivalent substitutions may be made to a part ofthe technical features in the embodiments. Those modifications andsubstitutions would not make the essential of the technical solutionsdeviate from the spirit and the scope of the technical solutions in theembodiments of the present disclosure.

The invention claimed is:
 1. A karaoke processing method, comprising:receiving, by a first client, a first antiphonal singing requestinputted by a first user, wherein the first antiphonal singing requestis used to request initiating antiphonal singing; acquiring, by thefirst client, an accompaniment audio file and a lyrics file of a songfrom a server based on the first antiphonal singing request, andrecording, by the first client, singing performed by the first userbased on a part of lyrics by using the accompaniment audio file and thelyrics file, to obtain a first recorded file; determining, by the firstclient, a first lyrics part based on the first recorded file andgenerating paragraph information, wherein the first lyrics part is thepart of lyrics already sung by the first user, and the paragraphinformation comprises identification information for identifying thefirst lyrics part; performing, by the first client, audio mixingprocessing on the first recorded file and the accompaniment audio file,to obtain a first target file; and sending, by the first client, thefirst target file and the paragraph information to the server.
 2. Thekaraoke processing method according to claim 1, wherein the determining,by the first client, the first lyrics part based on the first recordedfile comprises: determining, by the first client, whether recorded voiceof the first user corresponding to a lyrics part is effective, anddetermining, by the first client, the lyrics part as the first lyricspart in a case that the recorded voice of the first user correspondingto the lyrics part is effective.
 3. The karaoke processing methodaccording to claim 2, wherein the determining, by the first client,whether the recorded voice of the first user corresponding to the lyricspart is effective comprises: determining, by the first client, whethervolume of the voice is greater than a preset value, and determining, bythe first client, that the recorded voice of the first usercorresponding to the lyrics part is effective in a case that the volumeof the voice is greater than the preset value.
 4. The karaoke processingmethod according to claim 1, wherein the determining the first lyricspart based on the first recorded file by the first client comprises:determining, by the first client, whether recorded voice of the firstuser corresponding to a lyrics part is effective, determining, by thefirst client, whether a time occupied by the recorded voice isconsistent with a display time corresponding to the lyrics part in acase that the recorded voice of the first user corresponding to thelyrics part is effective, and determining, by the first client, thelyrics part as the first lyrics part in a case that the time occupied bythe recorded voice is consistent with the display time corresponding tothe lyrics part.
 5. The karaoke processing method according to claim 1,before the performing, by the first client, the audio mixing processingon the first recorded file and the accompaniment audio file to obtainthe first target file, further comprising: clearing, by the firstclient, audio data corresponding to the other part of the lyrics thanthe first lyrics part in the first recorded file based on the paragraphinformation.
 6. The karaoke processing method according to claim 1,wherein the identification information comprises a display timecorresponding to the first lyrics part or a sentence identifier of thefirst lyrics part.
 7. The karaoke processing method according to claim1, before the sending, by the first client, the first target file andthe paragraph information to the server, further comprising: generatinga configuration file by the first client, wherein the configuration filecomprise a sound effect parameter used by the first user; and sendingthe configuration file to the server by the first client.
 8. The karaokeprocessing method according to claim 1, further comprising: acquiring,by a second client, the first target file and the paragraph informationfrom the server, and providing, by the second client, a karaoke serviceto a second user based on the paragraph information with taking thefirst target file as accompaniment audio.
 9. The karaoke processingmethod according to claim 8, wherein the acquiring, by the secondclient, the first target file and the paragraph information from theserver and providing, by the second client, the karaoke service to thesecond user based on the paragraph information with taking the firsttarget file as the accompaniment audio comprises: receiving, by thesecond client, a second antiphonal singing request sent by the seconduser, wherein the second antiphonal singing request is used to requestjoining the antiphonal singing; acquiring, by the second client, thefirst target file, the lyrics file and the paragraph information of thesong from the server based on the second antiphonal singing request, andrecording, by the second client, singing performed by the second userbased on all of the other part of the lyrics than the first lyrics partaccording to the paragraph information with taking the first target fileas accompaniment audio, to obtain a second recorded file; andperforming, by the second client, audio mixing processing on the secondrecorded file and the first target file, to obtain a second target file.10. The karaoke processing method according to claim 9, before theperforming audio mixing processing on the second recorded file and thefirst target file by the second client to obtain the second target file,further comprising: clearing, by the second client, audio datacorresponding to the first lyrics part in the second recorded file basedon the paragraph information.
 11. The karaoke processing methodaccording to claim 9, wherein in a case that the server comprises aconfiguration file, the acquiring, by the second client, the firsttarget file, the lyrics file and the paragraph information of the songfrom the server based on the second antiphonal singing request and therecording, by the second client, singing performed by the second userbased on a second lyrics part according to the paragraph informationwith taking the first target file as the accompaniment audio to obtain asecond recorded file comprises: acquiring, by the second client, thefirst target file, the lyrics file, the paragraph information and theconfiguration file of the song from the server, and recording, by thesecond client, singing performed by the second user based on all oflyrics other than the first lyrics part according to the paragraphinformation and the configuration file with taking the first target fileas accompaniment audio, to obtain the second recorded file.
 12. Thekaraoke processing method according to claim 8, wherein the acquiring,by the second client, the first target file and the paragraphinformation from the server, and providing, by the second client, thekaraoke service to the second user based on the paragraph informationwith taking the first target file as the accompaniment audio comprises:receiving, by the second client, a third antiphonal singing request sentby the second user, wherein the third antiphonal singing request is usedto request joining the antiphonal singing and indicate to record basedon only a part of the other part of the lyrics than the first lyricspart; acquiring, by the second client, the first target file, the lyricsfile and the paragraph information of the song from the server based onthe third antiphonal singing request, and recording, by the secondclient, singing performed by the second user based on the part of theother part of the lyrics than the first lyrics part according to theparagraph information with taking the first target file as accompanimentaudio, to obtain a third recorded file; determining, by the secondclient, a second lyrics part and adding identification information foridentifying the second lyrics part in the paragraph information, whereinthe second lyrics part is the part of the lyrics already sung by thesecond user; performing, by the second client, audio mixing processingon the third recorded file and the first target file, to obtain a thirdtarget file; and sending, by the second client, the third target fileand the paragraph information to the server, wherein a third clientacquires the third target file and the paragraph information from theserver and provides a karaoke service to a third user based on theparagraph information with taking the third target file as accompanimentaudio.
 13. A karaoke processing system, comprising a first client, and asecond client, wherein the first client is configured to: receive afirst antiphonal singing request inputted by a first user, wherein thefirst antiphonal singing request is used to request initiatingantiphonal singing; acquire an accompaniment audio file and a lyricsfile of a song from a server based on the first antiphonal singingrequest, and record singing performed by the first user based on a partof lyrics by using the accompaniment audio file and the lyrics file, toobtain a first recorded file; determine a first lyrics part based on thefirst recorded file and generate paragraph information, wherein thefirst lyrics part is the part of lyrics already sung by the first user,and the paragraph information comprises identification information foridentifying the first lyrics part; perform audio mixing processing onthe first recorded file and the accompaniment audio file, to obtain afirst target file; and send the first target file and the paragraphinformation to the server; and the second client is configured toacquire the first target file and the paragraph information from theserver, and provide a karaoke service to a second user based on theparagraph information with taking the first target file as accompanimentaudio.
 14. The karaoke processing system according to claim 13, whereinthe first client is configured to determine whether recorded voice ofthe first user corresponding to a lyrics part is effective, anddetermine the lyrics part as the first lyrics part in a case that therecorded voice of the first user corresponding to the lyrics part iseffective.
 15. The karaoke processing system according to claim 13,wherein the first client is configured to determine whether recordedvoice of the first user corresponding to a lyrics part is effective,determine whether a time occupied by the recorded voice is consistentwith a display time corresponding to the lyrics part in a case that therecorded voice of the first user corresponding to the lyrics part iseffective, and determine the lyrics part as the first lyrics part in acase that the time occupied by the recorded voice is consistent with thedisplay time corresponding to the lyrics part.
 16. The karaokeprocessing system according to claim 13, wherein the first client isfurther configured to clear audio data corresponding to the other partof the lyrics than the first lyrics part in the first recorded filebased on the paragraph information before performing the audio mixingprocessing on the first recorded file and the accompaniment audio fileto obtain the first target file.
 17. The karaoke processing systemaccording to claim 13, wherein the identification information comprisesa display time corresponding to the first lyrics part or a sentenceidentifier of the first lyrics part.
 18. The karaoke processing systemaccording to claim 13, wherein the first client is further configuredto: generate a configuration file before sending the first target fileand the paragraph information to the server, wherein the configurationfile comprise a sound effect parameter used by the first user; and sendthe configuration file to the server.
 19. The karaoke processing systemaccording to claim 13, wherein the second client is configured to:receive a second antiphonal singing request sent by the second user,wherein the second antiphonal singing request is used to request joiningthe antiphonal singing; acquire the first target file, the lyrics fileand the paragraph information of the song from the server based on thesecond antiphonal singing request, and record singing performed by thesecond user based on all of the other part of the lyrics than the firstlyrics part according to the paragraph information with taking the firsttarget file as accompaniment audio, to obtain a second recorded file;and perform audio mixing processing on the second recorded file and thefirst target file, to obtain a second target file.
 20. The karaokeprocessing system according to claim 13, wherein the second client isconfigured to: receive a third antiphonal singing request sent by thesecond user, wherein the third antiphonal singing request is used torequest joining the antiphonal singing and indicate to record based ononly a part of the other part of the lyrics than the first lyrics part;acquire the first target file, the lyrics file and the paragraphinformation of the song from the server based on the third antiphonalsinging request, and record singing performed by the second user basedon the part of the other part of the lyrics than the first lyrics partaccording to the paragraph information with taking the first target fileas accompaniment audio, to obtain a third recorded file; determine asecond lyrics part and add identification information for identifyingthe second lyrics part in the paragraph information, wherein the secondlyrics part is the part of the lyrics already sung by the second user;perform audio mixing processing on the third recorded file and the firsttarget file, to obtain a third target file; and send the third targetfile and the paragraph information to the server, wherein a third clientacquires the third target file and the paragraph information from theserver and provides a karaoke service to a third user based on theparagraph information with taking the third target file as accompanimentaudio.